Modularity in Inductively-Learned Word Pronunciation Systems

نویسندگان

  • Antal van den Bosch
  • A. J. M. M. Weijters
  • Walter Daelemans
چکیده

In leading morpho-phonological theories and state-of-the-art text-to-speech systems it is assumed that word pronunciation cannot be learned or performed without in-between analyses at several abstraction levels (e.g., morphological, graphemic, phonemic, syllabic, and stress levels). We challenge this assumption for the case of English word pronunciation. Using igtree, an inductive-learning decision-tree algorithms, we train and test three word-pronunciation systems in which the number of abstraction levels (implemented as sequenced modules) is reduced from five, via three, to one. The latter system, classifying letter strings directly as mapping to phonemes with stress markers, yields significantly better generalisation accuracies than the two multimodule systems. Analyses of empirical results indicate that positive utility effects of sequencing modules are outweighed by cascading errors passed on between modules.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computer Assisted Pronunciation Teaching (CAPT) and Pedagogy: Improving EFL learners’ Pronunciation Using Clear Pronunciation 2 Software

This study examined the impact of Clear Pronunciation 2 software on teaching English suprasegmental features, focusing on stress, rhythm and intonation. In particular, the software covers five topics in relation to suprasegmental features including consonant cluster, word stress, connected speech, sentence stress and intonation. Seven Iranian EFL learners participated in this study. The study l...

متن کامل

p t . 5 - 1 0 , 1 9 9 9 , p p . 2 1 2 3 - 2 1 2 6 . MACHINE LEARNING OF WORD PRONUNCIATION : THE CASE AGAINSTABSTRACTIONBertjan

Word pronunciation can be learned by inductive machine learning algorithms when it is represented as a classiication task: classify a letter within its local word context as mapping to its pronunciation. On the basis of generalization accuracy results from empirical studies, we argue that word pronunciation, particularly in complex spelling systems such as that of En-glish, should not be modell...

متن کامل

Data-driven lexical modeling of pronunciation variations for ASR

In this paper a method for the automatic construction of a lexicon with multiple entries per word is described. The basic idea is to transform a reference word transcription by means of stochastic pronunciation rules that can be learned automatically. This approach already proved its potential (Cremelie & Martens, 1999), and is now brought to a much higher level of performance. Relative reducti...

متن کامل

Generation of Word Pronunciation Networks from Automatically Learned Inter-word Coarticulation Rules

| In this paper a method for learning inter-word coarticulation rules from a training set is proposed. The algorithm is based on a comparison of the standard transcription (i.e. `how should it be pronounced') of each utterance with its auditory transcription (i.e. `what was actually pro-nounced'). It is shown that the latter transcriptions can be obtained without human intervention: the speech ...

متن کامل

A machine learning approach to Swedish word pronunciation

This study focuses on word pronunciation in Text-to-Speech systems for Swedish. The purpose is to investigate whether machine learning techniques match knowledge-based systems in Swedish word pronunciation. The experiments show a maximum grapheme accuracy of just over 97%, and word accuracies from 67.0% for word pronunciation excluding stress assignment, which compares favourably to existing kn...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998